Improvements in Tone Pronunciation Scoring for Strongly Accented Mandarin Speech

نویسندگان

  • Fuping Pan
  • Qingwei Zhao
  • Yonghong Yan
چکیده

This paper discusses a tone pronunciation scoring system of Mandarin. It recognizes tones of syllables by using GMM model and uses the recognition results for tone assessment. Initially, experiment results are bad on strongly accented speech. There are two reasons: one is that the inaccurate force-alignment leads to incomplete F0 contours; the other is due to the special pattern of F0 contours. We propose several measures to the problems. The first is to make the extraction of F0 contour independent of the force-alignment. The second is to base the scoring on GMM posterior probabilities. The third is to use the same accented speech to train the GMM model. And the last is to train the fractionized bi-tone GMM models to cover tone changes in the multiplecharacter words. After these measures are taken, the tone scoring correct rate is improved from 60.2% to 83.3%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech

In this paper we present our investigation into improving the performance of our computer-assisted language learning (CALL) system through exploiting the acoustic model and features within the speech recognition framework. First, to alleviate channel distortion, speakerdependent cepstrum mean normalization (CMN) is adopted and the average correlation coefficient (average CC) between machine and...

متن کامل

iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent

We present iCALL, a speech corpus designed to evaluate Mandarin Chinese pronunciation patterns of non-native speakers of European descent, developed at the Institute for Infocomm Research (IR) in Singapore. To the best of our knowledge, iCALL is larger than any reported non-native corpora to date in terms of utterance number, duration, and number of speakers: iCALL consists of 90,841 utterances...

متن کامل

SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese

We present SingaKids-Mandarin, a speech corpus of 255 Singaporean children aged 7 to 12 reading Mandarin Chinese, for a total of 125 hours of data (75 hours of speech) and 79,843 utterances. This corpus is phonetically balanced and detailed in human annotations, including phonetic transcriptions, lexical tone markings, and proficiency scoring at the utterance level. The reading scripts span a d...

متن کامل

Robust automatic speech recognition for accented Mandarin in car environments

This paper addresses the issues of robust automatic speech recognition (ASR) for accented Mandarin in car environments. A robust front-end is proposed, which adopts a Minimum Mean-Square Error (MMSE) estimator to suppress the background noise in frequency domain, and then implements spectrum smoothing both in time and frequency index to compensate those spectrum components distorted by the nois...

متن کامل

Accent detection and speech recognition for Shanghai-accented Mandarin

As speech recognition systems are used in ever more applications, it is crucial for the systems to be able to deal with accented speakers. Various techniques, such as acoustic model adaptation and pronunciation adaptation, have been reported to improve the recognition of non-native or accented speech. In this paper, we propose a new approach that combines accent detection, accent discriminative...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006